Robust Tests in Online Decision-Making
نویسندگان
چکیده
Bandit algorithms are widely used in sequential decision problems to maximize the cumulative reward. One potential application is mobile health, where goal promote user's health through personalized interventions based on user specific information acquired wearable devices. Important considerations include type of, and frequency with which data collected (e.g. GPS, or continuous monitoring), as such factors can severely impact app performance users’ adherence. In order balance need collect that useful constraint of impacting performance, one needs be able assess usefulness variables. feedback sequentially correlated, so traditional testing procedures developed for independent cannot apply. Recently, a statistical procedure was actor-critic bandit algorithm. An algorithm maintains two separate models, actor, action selection policy, other critic, reward model. The well validity test guaranteed only when critic model correctly specified. However, misspecification frequent practice due incorrect functional form missing covariates. this work, we propose modified robust derive novel actor parameters case.
منابع مشابه
Provenance for Online Decision Making
It is commonly believed that provenance can be utilised to form assessments about the quality, reliability or trustworthiness of data. Once presented with contradictory or questionable information, users can seek further validation by referring to its provenance. While there has been some effort to design principled methods to analyse provenance, the focus has mostly been on offline use of prov...
متن کاملRobust Multi-Stage Decision Making
Testifying to more than ten years of academic and practical developments, this tutorial attempts to provide a succinct yet unified view of the robust multi-stage decision making framework. In particular, the reader should better understand: (1) the distinction between static versus fully or partially adjustable decisions; (2) the root of tractability issues; (3) the connection to robust dynamic...
متن کاملRobust Decision-making Under Ambiguity
Most of management research, following on the paradigm of expected utility theory, has developed complex models of optimal managerial action in the presence of uncertainty. Still, the assumption that managers assign probabilities to outcomes, and consequently optimize their actions has come under criticism from a number of empirical studies. Researchers, working on probability assessments, have...
متن کاملOnline Decision Making in VR Application Environments
The aim of this paper is to understand the process by which consumers’ perception of online VR environments impact their purchase decision. Combining factor and process models, we propose a transaction framework suggestive of consumer decision-making in VR e-commerce environments. The framework is informed by theory to be validated by an experimental design to understand the antecedents and con...
متن کاملOnline Decision-Making in General Combinatorial Spaces
We study online combinatorial decision problems, where one must make sequential decisions in some combinatorial space without knowing in advance the cost of decisions on each trial; the goal is to minimize the total regret over some sequence of trials relative to the best fixed decision in hindsight. Such problems have been studied mostly in settings where decisions are represented by Boolean v...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2022
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v36i9.21240